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Abstract 


In this thesis algorithms are proposed for some graph problems. A new parallel 
algorithm that runs in O(logn) time with 0(n) cost is proposed for the problem of 
updating the minimum spanning tree of an undirected underlying graph. A bounded 
incremental algorithm is also proposed for this problem. Edge insertion and vertex 
insertion updates are considered. 

Constant time parallel algorithms are proposed for testing whether a given graph 
contains a triangle. Algorithms are also given to recognize (b) claw free graphs and 
(c) for listing all simplicial vertices in a graph. 0(log n) time algorithms are proposed 
for testing presence of a incomparability graph on 6 vertices, a forbidden subgraph 
for partial-3 trees. 

Existing optimal algorithm for computing the maximal matching of a general 
graph in parallel takes 0(log^ n) time. This also matches the parallel time for com- 
puting the maximal matching in a bipartite graph with optimal work. There is also a 
nonoptimal algorithm which runs in time 0(log^’® n) with 0[{m+n) log^ n) w'ork. We 
conjecture that computing a maximal matching in bipartite graph should be faster 
than in a general graph and also prove that the parallel time complexity of maximal 
matching in general graph is 0(loglognt{,(n, m)-i-£so(n. m)) where ti,{n,m) is the time 
for bipartite graph and tso{n, m) is the time for a nonoptimal algorithm for general 
graphs. Thus, using our result any improvements in the bipartite matching algorithm 
would reflect in improvement in the parallel time complexity of the problem in general 
graph. 

For edge colouring of graphs we give a constant time algorithm to produce a 
valid log n colouring using the bipartite decomposition of a given graph into edge 
disjoint bipartite graphs. We also reduce the number of colours to 0(A“logfc if we 



are provided with a vertex colouring using k colours. 

For numbers drawn forni a restricted universe, called integers, we propos a sub- 
linear time sequential algorithm to compute the minima. The time taken by the 
algorithm depends on the word si2e of the machine and the size of the universe from 
which the integers are drawn. A parallel algorithm is also obtained for this problem. 



Acknowledgments 


I express my sincere thanks to Dr. Sanjeev Saxena for his guidance during the course 
of this work. His valuable advice has helped me in many situations. His patient 
reading of the report has uncovered many typos that crept. 

I also wish to thank my friends, SriRam, Madhu, Kiran and Yugandhar for many 
discussions with them for the thesis and off the thesis. .-Vll of them paid a willing ear 
many a time during this work. The KLC trio also have given me a good company. 
Friends of M. Tech96 batch also made my stay during this course a happy one. The 
lively atmosphere of Hall 4 is hardly forgettable. The evenings at Hall 4 quad remain 
the happiest ever in my life. 

I would also like to extend thanks to my family members for encouraging me and 
supporting me during the last six years of my stay outside home. 

Last but not the least, I would like to thank all those who directly or indirectly 
contributed towards the successful completion of this work. 

For any of the typos or errors I extend my apologies. 


VI 



Contents 


1 Introduction 1 

1.1 Introduction 1 

1.1.1 Preliminaries 1 

1.2 Model of Computation 3 

1.3 Oven'iew of the thesis 4 

2 Parallel and Bounded Incremental Minimum Spanning Tree 6 

2.1 Introduction 6 

2.2 EREW MST Update 7 

2.3 Bounded Incremental .Algorithm 12 

2.3.1 Bounded Incremental Computation 12 

2.3.2 Bounded Incremental Algorithm for MST Update 13 

2.4 Single \ ertex Update Problem 18 

2.5 Conclusions 19 

3 Efficient Parallel Recognition of Small Induced Subgraphs 20 

3.1 Introduction 20 

3.2 Recognition of Triangle 21 

3.2.1 Recognition of Claw Free Graphs 23 

3.3 Subgraphs on Six \ enices 25 

3.3.1 Recognition of /Ce . . 25 

3.3.2 Recognition of .l/e 26 

3.4 Listing All Simplicial Vertices 27 

vii 



4 On the Parallel Complexity of Maximal Matching 30 

4.1 Introduction. 30 

4.2 Reduction 31 

4.3 Final Remarks 34 

5 Parallel Edge Colouring of Graphs 35 

5.1 Introduction 35 

5.2 Definitions 37 

5.3 Edge Colouring Graphs in 0(1) Time 37 

5.4 Edge Colouring of General Graphs 39 

5.5 Conclusions 40 

6 Minima in Sublinear Time 41 

6.1 Introduction 41 

6.2 Sequential Algorithm 42 

6.2.1 Minima of two words 43 

6.2.2 Minima of n integers 45 

6.3 Parallel Algorithm 45 

6.4 Practical Issues 46 

7 Conclusions 48 

7.1 Conclusions 48 

7.2 Further Work 49 


viu 



List of Figures 


1 (a) Claw graph, (b) Incomparability graph on 6 vertices and (c) For- 
bidden subgraph on 6 vertices for partial 3 trees 2 

2 The “left”, “right”, “above” and “below” regions of node u 9 

3 Identification of unique path bem-een a pair of vertices u and u in a 
tree T. The number to the left in the box associated with each node 


gives it preorder number and the number to the right is its postorder 
number 10 

4 The minimum spanning tree T along with the edge added shown in 

dotted lines. The set of AFFECTED edges are shown by a cross across 
them. The AFFECTED vertices are shown with a over them. ... 14 

5 Updating the parent pointers along the path from u to v. The dashed 

edge is the edge that leaves T and (u, v) enters the tree 17 

6 Showing the formation of cycles during single vertex update. The bold 
edge coming from z is the tree edge at z and the dashed edges are the 
edges that induce cycles in T. The weights of the edges are not shown 


in the figure 19 

7 Claw graph. The vertex v of degree 3 is called the central vertex. . . 23 

8 The graph ICe- It is a incomparability graph on six vertices 25 

9 The graph A/e- This is a forbidden subgraph for partial-3 trees on six 

vertices 27 

10 Merge does not give a maximal matching 32 

11 The bad case. The vertices a, 6, c will not be covered by the matching 33 

12 A 4-edge colouring of graph G. The numbers at each edge represent 

the colour assigned to that edge 36 


IX 



13 Format of a machine word 


43 


X 



Chapter 1 


Introduction 


1.1 Introduction 

Parallel processing involves employing more than one processor to perform a given 
task. There is a strict upper bound on the processor speeds of traditional uniproces- 
sor s>-stems. This bound is dictated by the speed of light and the minimum distance 
required between two components so that they do not interact with each other. Hence 
hardware technology alone may not satisfy the ever increasing demands of computa- 
tional power. Thus parallel computers have emerged as an alternative to overcome 
this problem. 

An algorithm is a sequence of steps directing the computer to solve a particular 
problem. .A sequential algorithm specifies the actions to be taken by a single processor 
for solving a problem. In a sequential algorithm only a single instruction is executed 
at any time. In a parallel algorithm, the algorithm is executed by more than one 
processor simultaneously thus reducing the computation time. But however we have 
to extract the parallelism inherent in the problem to obtain good parallel algorithms. 
The next subsection describes in brief the problems chosen for this thesis. 

1.1.1 Preliminaries 

In this section we discuss the problems considered in this thesis in more detail. 

• Updating MST: The problem involves updating the minimum spanning tree 
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(a> (b) 

Figure 1; (a) Claw graph, (b) Incomparability graph 
subgraph on 6 vertices for partial 3 trees. 



on 6 vertices and (c) Forbidden 


of an underlying graph G under edge insertions. That is, given a minimum 
spanning tree T of the graph G under some weight function W : E R and 
an edge e = (u, v) with weight ^((u. u)), find the new minimum spanning tree 
T' of the graph G' = GU {(u, u)}. We present an O(logn) time 0(n) w'ork 
EREW algorithm for this problem. We also consider an alternative complexity 
measure for analyzing dynamic algorithms [44]. 

• Small Subgraph Recognition: This centers around identifying for the presence 
of certain subgraphs in a given graph. We consider triangle, claw graph, incom- 
parability graph on 6 vertices. The claw' graph, incomparability graph and the 
forbidden subgraph for partial 3-trees are shown in Figure 1. We also consider 
the problem of listing all simplicial vertices in a graph. A vertex is called a 
simplicial vertex if it’s neighbourhood is a complete subgraph. 

• Maximal Matching; In this problem we are interested in the parallel complexity 
of maximal matching in a general graph. The study is motivated by the opinion 
that the case of bipartite graphs is simpler than general graphs. We derive an 
expression in terms of the time complexity of computing maximal matching in 
a bipartite graph. 

• Edge Colouring: The problem of edge colouring graphs is to assign colours to 
the edges of a graph G with the restriction that no two edges sharing a common 
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end point are assigned the same colour. This finds applications in scheduling 
problems. 

• Integer Minima; Here we are interested in finding the minima of a set of integers, 
drawn from a restricted domain in less than linear time. We also give a parallel 
algorithm for this problem. The time requirements depend on the word size of 
the machine and the range of the universe. 


1.2 Model of Computation 

To analyze algorithms, we need a abstract model of computation on which ail the 
cost and space requirements of the algorithm can be expressed and compared against 
existing ones for answering questions of efficiency. In this thesis, we use the Paral- 
lel Random .A.ccess Machine (PRAM) as the model of computation for the parallel 
algorithms. PR.A.M is the parallel analogue of the unit cost sequential random ac- 
cess machine. The model involves N processors numbered from 0 to iV - 1. All the 
processors have access to a memory consisting of cells numbered from 0. Further, 
the processors are assumed to be aware of their number which is also called its id. 
Depending on the way simultaneous access to a same memory location by more than 
one processor is allowed, the family of PRAM’s is classified into 3 classes as below. 

. Exclusive Read Exclusive IFrite(EREW): In this model, at any particular in- 
stant no more than one processor is allowed to either read or write to the same 
memor\' location. The algorithm designer should write algorithms in such a 
way that such conflicts never occur. 

. Concurrent Read Exclusive Write(CREW); In this model simultaneous read of 
a memory- location by more than one processor is allowed but simultaneous 
write to the same memory location in forbidden. 

• Concurrent Read Concurrent Write(CRCW): This model allows both simulta- 
neous read as well as write of same memory locations by more than one pro- 
cessor. According to the resolution method of Concurrent write this is further 
subclassified into the following models. 
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- COMMON; All processors writing to a same location should write the 
same \’alue. 

— PRIORITY: The smallest numbered processor succeeds in the write. 

- ARBITRARY; One processor is gaurenteed to succeed but no commitment 
is made as to which processor succeeds. 

A parallel algorithm that runs in time C>(t(n)) using p(n) processors is said to 
have a cost of c(n) = 0{t{n)p{n)). If this cost equals the cost of the best known 
sequential algorithm, then the parallel algorithm is said to be work optimal [30]. 

In chapter 6, we however deviate from this model of computation and use the 
nonconservative RAM model. In this model, the word size supported by the machine 
is bigger than in the R.A.M. Specifically, the word size of the machine is assumed to 
be logarithmic (upto a constant) in input. Thus for inputs of size of N, RAM is 
supposed to have a word size of 0(logiV + logn) where n is the size of the problem. 
The nonconser\'ative RAM is assumed to have a word size of 0(logiVlog‘^n) for some 
constant c. This model too has a parallel analogue named the nonconservative PRAM 
and all the concepts and classifications of PRAM apply to this model also. 


1,3 Overview of the thesis 

The rest of the thesis is organized as follows. In Chapter 2. we discuss the algorithms 
for updating the minimum spanning tree of an underlying graph subject to edge in- 
sertions. In Chapter 3, we present the algorithms for recognition of small subgraphs. 
Algorithms are presented for recognition of triangle, claw free graphs, an incompara- 
bility graph on 6 vertices, a forbidden subgraph for partial 3— trees and also for listing 

simplicial vertices in a graph.{See Figure 1) 

In Chapter 4 we consider the problem of computing a maximal matching in a gen- 
eral graph. We obtain an expression stating that this problem requires O(log log ntb{n, m) 
tso{n,m)) time where ti,{n,m) is the time for bipartite matching algorithm and 
tso(n, m) is the time taken by the existing nonoptimal algorithm for general graphs. 
Since our result is optimal, any new optimal algorithm for bipartite graphs results m 
a better optimal algorithm for general graphs also. 
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In Chapter 5, we present the algorithms for edge colouring of graphs. Our algo- 
rithm runs in constant parallel time and produces a A^logn) colouring. We show' an 
explicit construction for this. 

Chapter 6 differs significantly form the above chapters in that the model of com- 
putation we use is the nonconservative RAM (and nonconseivative PRAM) where the 
word size of the machine is assumed to be substantially bigger. We make use of this 
inherent power in the machine to obtain a sublinear sequential algorithm for finding 
the minima of n integers. We also present a parallel algorithm for this problem on the 
parallel analogue of this nonconseivative RAM which is called the nonconservative 
PR.\M. 

In Chapter 7 we offer some concluding remarks and state some of the possible 
extensions from our work. 
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Chapter 2 

Parallel and Bounded Incremental 
Minimum Spanning Tree 

2.1 Introduction 

Graphs play a key role in many problems related to computer science because of their 
numerous applications. Dynamic graph problems too have received great attention 
[17]. Dynamic problems involve updating the solution subject to changes in input 
without building from scratch. In this chapter we consider the problem of updating 
the minimum spanning tree of a graph after edge insertions. More formally, we define 
the problem as follows. Given a graph G = (V' E) with a weight function W : E R 
and a minimum spanning tree(MST) T of G, update the MST when an edge e = {u,v) 
with w'eight W{e) is added to G. According to terminology' of Eppstein et. al. [17] 
the problem can be called partially dynamic incremental MST update. 

The problem of updating an MST has good algorithms in the sequential model 
of computation. Fredrickson [21, 20] use a data structure called topology trees which 
supports update operations in 0{\/^ time. Some special cases when the underlying 
graph is planar are also considered by Eppstein et al.[18]. 

The problem of MST update on a PRAM has been extensively studied. Pawagi 
and Ramakrishnan [43] consider MST update subject to two modifications namely 
increase and decrease in edge weight and adding a vertex z along with edges incident 
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from z. The} present 0{iogn) time algorithm on CREW PRAM with O(n^) proces- 
sors for both these problems. The same algorithm also supports edge insertions as 
well as deletions within the same bounds. Although their algorithms are very much 
uniform in nature for all the problems, they are very costly because of the amount of 
information that they compute in the form of tables and M^) for every update. 
Johnson and Metaxas [32] considered the single and multiple vertex update problem. 
They present 0(log n) time 0(n) cost algorithms for the single vertex update problem 
on the EREW PRAM. Here we present an O(logn) time 0(n) cost EREW algorithm 
for the single edge insertion MST update problem. We thus improve the algorithm 
of Pawagi and Ramakrishnan [43]. 

In the second part of the Chapter, we treat the same problem and present a 
bounded incremental algorithm for updating the minimum spanning tree of an un- 
derlying graph under edge insertions. The complexity of the algorithm is measured in 
terms of the amount of change that the current solution has to undergo. This param- 
eter is denoted by P|| and the requirements of the algorithm is analyzed in terms of 
size of 1|<5|| denoted by |5|. This complexity measure was introduced by Ramalingam 

[44] for analyzing dynamic algorithms. 

The rest of the Chapter is organized as follows. The EREW algorithm with the 
same time and cost bounds is presented in Section 2.2. In Section 2.3, a bounded 
incremental algorithm for this problem is presented. This algorithm does work linearly 
proportional to the amount of change.a In Section 2.4 we consider the case of single 
vertex update. 

2.2 EREW MST Update 

In this section we present an EREW algorithm to update minimum spanning tree in 
O(logn) time and 0{n) cost. The algorithm uses routines for numbering the vertices 
of a tree in preorder and postorder [46] and LC.A computation of a pair of vertices 

[45] . These operations can be done in O(logn) optimal EREW time. Without loss 
of generality, we assume that for the edge (u, u), u occurs before v in the preorder 
traversal of the tree. 
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The key idea in this algorithm is to locate the edges which are part of the cycle 
formed after the new edge has been added. Preorder and postorder numbering of the 
vertices of the tree is used to identify the unique path between the vertices u and v 
in the original minimum spanning tree T. Our discussion is based on the Figure 3. 

If u is an ancestor of v, the nodes in the unique path between the given pair of 
vertices u and v are the vertices whose preorder number as well as the postorder 
number lie in between the preorder and postofder number of the vertices u and v. 
(See Figure 3 and Theorem 2.1.) 

Theorem 2.1 In a rooted tree T, for any two vertices u and v such that u is an 
ancestor of v, a node is in the unique path between the vertices u and v if its preorder 
number lies in the range [preorder (u), preorder (v)] and its postorder number lies in 
the range [postorder(v),postorder(u)J. 

Proof: We first prove the following result. In a rooted tree T, a node r is an 

ancestor of node u if the preorder number of r is less than the preorder number 
of u and the postorder number of r is greater than the postorder number of u i.e. 
preorder(u) < preorder(r) and postorder(r) > postorder(u). To prove the above 
statement, we first define “left”, “right’, "above' and “below” order on the nodes 
consistent with the embedding of the tree T . By the definition of the preorder number, 
any node r which has preorder number less than the preorder number of node u must 
be either “above” u or a “left” sibling of u or a node in the subtree rooted at one of the 
“left” siblings of u. Similarly, by the definition of postorder numbering, a node r with 
postorder number greater than the postorder number of u is either a node "above” 
u, or a “right” sibling of u or a node in the subtree rooted at the “right” siblings of 
u. Thus, the set of nodes satisfying both the conditions is the set of ancestor nodes. 
(See Fig 2.) 

We continue with proof of the theorem. Consider nodes u and v. The set of 
ancestors of v are a subset of ancestors of node u. Moreover, the set of ancestor 
nodes of v have preorder number less than that of v and postorder number greater 
than that of v (from the above result). The same holds for ancestors of u also. -\s u is 
an ancestor of v, preorderiu) < preorder{v) and postorder{u) > postorder{v). Thus 
the nodes in the path from u to u have preorder number which is greater than that 


8 



above 



of V and less than that of u. Similarly, the postorder numbers of the nodes between 
u and V lies in the range [postorder(tt),postorder(u)]. Thus if we consider the nodes 
with the numbers in the range specified by the theorem, we will be left with the set 
of nodes between u and v. I 

Theorem 2.1 gives an efficient way to compute the path between a node and its 
ancestor in the tree T. If it is the case that neither is an ancestor of the other, 
then observe that the path from u to u is the concatenation of the path from u to 
LCA{u, v) and LCA{u, v) to i\ 

An example is shown in Figure 3. Here u and v are neither an ancestor nor a 
descendant of the other. So we compute LCA{u, v), which is n, and compute the two 
paths from x to u and from x to v. The path is shown in heavy lines. 

These observations lead to the following algorithm. 

procedure EREWX'pdate 

comment Given a graph G = {Vt E), b. rooted minimum spanning tree T of G 
and an edge e = (u, v) being added to G. 
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Figure 3; Identification of unique path between a pair of vertices u and u in a tree 
T. The number to the left in the box associated with each node gives it preorder 
number and the number to the right is its postorder number 
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begin 

Step 1. for every node of the tree T obtain the preorderQ and 
postorderO numbers of the nodes and the parent P() of each node. 

Step 2.0. let x = LCA{u,v) 

Step 2.1for every vertex r do in parallel 

if { preorder(r) € (preorder(x),preorder(t;)] and 
postorder(r) € [postorder(i;),postorder(a;)) ) 

InJPath(r) = True; 

endfor; 

Step 2.2. if LC A{u,v) ^ u then compute the path between LCA{u,v) and 
u as in Step 2.1. 

comment Identify the edge to be removed. Note that if vertex r 
is in the path, then the corresponding tree edge is (r, P(r)). 
comment Array M of size n with each entry' initialized to — oo is used. 

Step 3. for all vertices r do in parallel 
if In JPath(r) then 
M[Preorder{r)] := Wt{r, P(r)); 

Step 4. Find the maximum value in the array M and remove the corresponding 
edge from the tree. 


A simple analysis of the algorithm shows that Steps 2,3, and 4 can be done in 0(1) 
time with 0(n) processors and Step 1 in O(logn) time and 0{n/ logn) processors. So 
the whole algorithm takes time of 0{logn) with 0{n/logn) EREW processors and 
the following Theorem follows. 

Theorem 2.2 The minimum spanning tree of a graph G = (F, E) can be updated 
in 0{n/p + logn) time with p processors on a EREW-PRAM model when an edge 
e = (u, v) with weight IV (e) is added to G. I 
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2.3 Bounded Incremental Algorithm 

In this section, we present a sequential algorithm to update the minimum spanning 
tree of a underlying graph G subject to edge insertions. The time complexity of the 
algorithm is measured in terms of the amount of change that the current solution has 
to undergo in order to correctly reflect the new state. Ramalingam [44] introduces 
this notion of complexity’ measure for incremental algorithms and gives incremental 
algorithms for several graph problems where complexity is measured in terms of the 
parameter namely the amount of change denoted by ||(511. In the subsection 2.3.1 we 
briefly discuss this measure of complexity and the algorithm is given in subsection 
2.3.2. 

2.3.1 Bounded Incremental Computation 

In general, an incremental graph problem V takes as input G, the solution to graph 
G and input change 5. The job of the algorithm is to compute the solution for the 
graph G U {6} where GU {5} is the graph obtained by making a change 5 to G. Here 
we measure the time and space requirements of the algorithm not in terms of the 
conventional size of input but as a measure of the amount of change in input and 
output required to modify the solution. 

More formally, a vertex ?i e G or G U {<5} is said to be a modified vertex if 6 
inserted u or deleted u. (In our case, only insertions need be considered.) The set 
of modified vertices is denoted by the set MODIFIEDg,^. 5 creates a set of vertices 
that are affected by the change. With reference to the problem under consideration 
a vertex is said to be affected iff that node is in path(u. v) in the given minimum 
spanning tree T. We call this set as AFFECTEDg. 5 . Similar definitions for the set 
of edges modified and affected can be given. Let CHAXGEDg ,(5 = MODIFIEDc^j u 
AFFECTEDg, 5 - This set captures the change in input and output. When there is 
no confusion, we can do away with the subscripts. Finally, H^H is the union of the 
sets capturing CHANGEDj of both V‘(G) and E{G). This parameter H^jj captures 
the amount of change the current solution has to undergo for accounting to change 
in the underlying graph G. The complexity of the incremental algorithm is measured 
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in terms of the size of this parameter namely |(5|. 

An incremental algorithm is said to be bounded if the time taken by the incremen- 
tal algorithm to process a change 6 to the problem V is bounded by some function 
Otherwise it is said to be unbounded. A djuamic problem is said to 
be (un)bounded if it has(resp. doesn’t have) a bounded incremental algorithm, that 
is there is an(no) incremental algorithm that runs in time which is a function of 5 
independent of the input size. 

For a measure of efficiency among incremental algorithms, classification may be 
based on the time requirement in terms of the parameter |5|. That is, an incremental 
algorithm is said to be polynomially bounded if its time complexity can be expressed 
as a polynomial of 5. Otherwise, it is said to be exponential..'\s can be inferred, in- 
cremental algorithms which have complexity of low-order polynomial of 5 are “good” . 

In the next section we apply these ideas to obtain am 0(15]) incremental algorithm 
for the update of minimum spanning tree. 

2.3.2 Bounded Incremental Algorithm for MST Update 

In this subsection we assume that the input is in the form of (node,parent) pairs. The 
assumption that the input tree T is rooted is vital as will be explained later. By a 
rooted tree we mean that each node knows its parent. Moreover, after the algorithm 
ends, the output that we generate should also be a rooted tree. 

If an edge e = (u, u) is added to G whose MST T is to be updated, we can 
define MODIFIED^ = {u, u}. Similarly, the addition of this edge {u,v) may result in 
change of parent pointers of nodes in the u, u path in the tree. Thus, AFFECTED^ = 
{r|r 6 T and r lies in the unique (u, v) path } where T is the input to the problem in 
the form of a rooted tree. The set of affected vertices and edges are shown in Figure 
4 after insertion of the edge {u,v). 

.Applying the notion of complexity measure in terms of amount of change 5 to our 
problem, we note that the value of 5 is given by the length of the path between the 
vertices u and v in the current minimmn spanning tree T. The length of the path is 
given by Preorder(u) -i- Preorder(u) - 2* Preorder (LCA(u, u)) - 2. 

Computing the path using Theorem 2.1 takes 0(d) time for this part. But the 
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Figure 4: The minimum spanning tree T along with the edge added shown in 
dotted lines. The set of AFFECTED edges are shown by a cross across them. The 
.AFFECTED vertices are shown with a over them. 

computation of preorder and postorder numberings cannot be generated within the 
same bounds. We instead avoid those explicit computations. We depend on the LCA 
computation of two nodes it and u in a rooted tree T. That is, traverse the path 
from V and it along their parents using knowledge of P(), every time checking for the 
LCA(it,t;) node. This process takes time which is 0(<5), 25 to be precise. During this 
process, we need to keep track of whether a node is seen by it during traversal or 
not. Thus during each step, the node visited by u is flagged and v does a checking 
operation for the node it visits most recently. So, it is assumed that each node has 
another unit space for storing this flag. As we keep track of the path traversed by u 
during this path computation, w’e can at the end of the algorithm reset these flags. 
Effectively, tve do not use any extra space that is to be charged to the algorithm. We 
use only 0{6) space for this whole process. To obtain the complete u, v path, we need 
to concatenate these t-wo paths, (see Aho et aZ.[l] for data structures to suppon such 
operations). This path is stored for later use. It is during this computation that the 
knowledge of parent is required. Otherwise, we cannot do this computation in the 
stated time bounds. 
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In the obtained path after we find the heaviest edge and compare its weight with 
ir(e). One of them remains in the minimum spanning tree and the other leaves the 
tree. We now address the restructuring of the tree T after this update. There are 
two cases. The trivial case is when T undergoes no change and clearly nothing needs 
to be done. In the other case, let the tree edge {p,q) leave the tree. Let (p, g) be 
in the path from LCA{u,v) to v. We assume that p = P{q). Now we update the 
parent pointer of all the nodes in the path g to u as follow’S. Label the nodes in the 
path from g to u as g = go, gi, ■ • - , gjk = u- Earlier, P{qk) = gjt-i and P(gt) = qi-i for 
I < i > k — 1. Now, we set P(i’ = g^) = u and P(gi-i) = gi for 2 < i < fc — 1. (See 
Figure 5.) In the other case, if an edge in the path from LCA{u,v) to u is the edge 
leaving the tree, then the following restructuring has to be done. Let the edge (p, g) 
leave the tree where p = P(g). Label the path from g to u by g = go, gi, g 2 , • • • , gt = u- 
Set P{gi) = gi+i, 0 <i < k and P{qk) = 

The above discussion can be captured in the following algorithm. 

Algorithm Update_MST(G,T,e): 

Input: The underlying graph G. the current minimum spanning tree 
T and the edge e = (u, v) being added to G. 

Output: The new minimum spanning tree, 
initialize cur.u = u and cur_v = u; 
mark(cur.u) = 1; mark(cur_v) = 1; 
index = 0; 

comment The while loop computes the unique u-v path in T . 
while not done 

cur.u = P(u); 
mark(cur_u) = index; 
index = index + 1; 
add cur.u to u.path: 
cur.v = P('u); 
add cur.v to v.path; 
if mark(cur_v) = 1 then 

concatenate the paths u-path(l:mark.u(cur_v)) and v.path to obtain the 
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u-v path in the tree T and store the vertices in the path in path.uv; 
done = true; 
end {while} 

find the maximum cost edge among the edges (r, P(r) for r G path.uv 
and remove that edge; 
comment: restructure the tree 

if the edge (p,q), p = P{q), in the path LCA{u, v) to v is the leaving edge 
Label the nodes in the path from q to u as g = 9i) • • • > 

set P{v = qk) = u; 
for 2 < i < A: - 1 do P{qi-i) = Qi ; 

if the edge (p, 9 ), p = P{q)i in the path LCA{u, u) to u is the leaving edge 
Label the path from qto uhy q = qo, qi, qz, ■ ■ ■ ■, qk = w, 

P{Qk) = u; 

for 0 < i < A: do P{qi) = qi+u 
end. (Algorithm) 

From the above discussion, the following theorem follows. 

Theorem 2.3 Partially Dynamic Incremental MST Update of a general underlying 
graph is incrementally bounded. In particular, there is an 0(6) time algorithm for 
performing the update of a MST under edge insertions using only 0(<5) space. M 

Thus after the algorithm ends, we have restructured the input tree to be a rooted 
tree. To be more precise, we start with a tree in which every node knows its parent 
and after the update, the same holds true. This restructuring is also important as 
rooting a tree takes time proportional to the number of nodes in the tree which our 
algorithm cannot afford to. Moreover, we make use of extra space associated wdth 
each node during the algorithm and at the end of the algorithm clear the space. If this 
clearing is not done, the space requirement grows to 0{n) or the time requirement 
grows to 0{5 log 5). If there is a data structure that supports Insert(a:) and LookUp{x) 
in constant time where the values that are stored in the data structure are in some 
suitably restricted domain, then we need only 0(5) space and time. There are data 
structures that support constant time lookup on static tables, (see [47]). 
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2.4 Single Vertex Update Problem 

In this Section, we give a bounded incremental algorithm for updating the hlST of 
an underlying graph after a vertex z along with edges from z are added to the graph 
G. The edges from z are associated with weights from the weight function PF. The 
algorithm is an extension of the algorithm presented in Section 2.3 to the case of 
single vertex update. 

We have to first find the minimum weight edge among the r edges incident from 
z and make it a tree edge. This does not violate the minimality of the MST of the 
new graph because of the property of the hlST that from every vertex the edge of 
minimum weight will always be in the MST. Then w^e proceed to break the cycles in 
the new tree T U {( 2 , 2 j)|ir( 2 :,Ci) = minj{W{z, Zj)},j = 1,2, • • - ,r. We describe the 
algorithm in detail below. 

Algorithm Single- Vertex-Update(G, T, z) 

Input: The underlying graph G, the current spanning tree T and the vertex z' 
along with r edges zi,Z 2 ,---,Zr from z. 

Output: The MST for G = {V U { 2 }, E U {(z, u)lu = Zi, i = 1, 2, • • • r}) 

Step 1. Add the minimum weight edge of z to T as tree edge 
Step 2. for i = 1 to r do 

Identify the cycle formed by the edge ( 2 , zt) in T 
comment This can be done in time proportional to the length of the 
cycle induced by (z, z^) using the algorithm in previous section 
Step 3. Remove the hea\iest edge in each cycle 
Step 4. Repeat Steps 2 and 3 until all cycles are removed 
comment Step 3 is required because if cycles thus formed in Step 1 intersect 
then the breaking of each cycle fails to remove all the cycles. (See Figure 6) 
end.{Algorithm} 


We now analyze the time required by the above algorithm. The time for Step 1 is 
0(r). For Step 2, using the algorithm in the pre\dous section, the time required is 
Si where df is the length of the cycle of (z, Zi) . Step 3 also takes the same time. 
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Figure 6: Showing the formation of cycles during single vertex update. The bold edge 
coming from z is the tree edge at z and the dashed edges are the edges that induce 
cycles in T. The weights of the edges are not shown in the figure. 

But these steps have to be repeated r — 1 times before all cycles are broken. Thus 
the time for the entire algorithm is (^i)- 

2.5 Conclusions 

We have described an 0(n) work O(logn) time parallel algorithms for incrementally 
constructing the spanning tree of a general underlying graph. Our algorithm can be 
easily seen to support the query of the tj'pe w^hether the edge (u, v) is a tree edge or 
not in constant time and also query about the the w-eight of the current minimum 
spanning tree is supported in constant time. However, the algorithms do not support 
edge deletion. 
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Chapter 3 

Efficient Parallel Recognition of 
Small Induced Subgraphs 


In this Chapter we give parallel algorithms for finding small subgraphs in parallel 
efficiently. We first give constant time parallel algorithms for finding whether a given 
graph contains a triangle as a subgraph or not. This is used to recognize claw free 
graphs in O(logn) parallel time with a cost of A sequential simulation of 

our algorithm improves the cost of the existing sequential algorithm from to 

Two algorithms for listing all simplicial vertices in a graph are also given. 
Algorithms to recognize two subgraphs on six vertices are also presented. 


3.1 Introduction 

An interesting problem is to check w’hether a given graph contains a triangle or 
not. Itai and Rodeh [29] present two algorithms for this problem. The running 
time is 0(n“) and where a is the exponent of matrix multiplication. Later 

an O(m^) algorithm for this problem is given by Alon et. al. [3] Here we give 
a new analysis to this problem to show that this problem can be solved in time 
O(no^ + rri^fn) sequential time where cr^ is the \'ariance of the degree sequence of 
the given graph. For sparse graphs, m < and for graphs where the \’ariance 
of the degree sequence is small our result is an improvement of the existing result. 
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We also present 0(1) time parallel algorithms for this problem using 0[nc^ + m-fn) 
processors. 

We give parallel algorithms for recognizing claw free graphs. The algorithm runs 
in O(logn) time. The cost of the algorithm is 0(m^-^) which improves the sequential 
algorithm of Kloks e£ ai. [37]. It is shown that matrix multiplication can be used 
in algorithms for recognition of graphs with two subgraphs on six vertices are also 
given. An O(logn) time algorithm for listing all simplicial vertices of the graph is 
also given. An alternative algorithm for this problem that runs in constant time with 
a small deterioration in cost is also given. 

We note that we eliminate the bottleneck of matrix multiplication in most of 
our algorithms. This gains further importance because of the constant parallel time 
achieved. 

The rest of the Chapter is organized as follows. In Section 3.2 we present the algo- 
rithm for recognizing the existence of a triangle in a graph. This is used in Subsection 
3.2.1 to recognize claw free graphs. In Section 3.3 the existence of triangles is used 
to identify the subgraphs on six vertices. Finally, Section 3.4 gives the algorithms for 
listing all simplicial vertices of a graph. 

3.2 Recognition of Triangle 

Itai and Rodeh [29] gave the following algorithm for checking whether a given graph 
contains a triangle. Let M be the adjacency matrix of the graph. Compute the 
square of the adjacency matrix. M^(u, v) = 1 iff there exists a vertex w such that 
M(u,w) = 1 and M(w,v) = 1. If M(u.u) = 1 then there exists a triangle passing 
through the edge (u,u) of the graph. Let B = A/'^ AXP AI, the boolean bit wise 
.4.YD of corresponding entries of and AI. The (u.v) entrv' of B contains a 1 iff 

there is a triangle through (u, v). 

A parallel algorithm for this can be made to run in O(logn) time using 0(n ) 
processors. This is because matrix multiplication can be done in O(logn) time on 
CREW PR.AM using 0(n®) processors [13]. ^^'e give an alternative algorithm that 

runs in constant parallel time. 
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The algorithm is to check at each vertex whether there is a triangle containing this 
vertex. Thi.s can be done in time 0{d{v)-) where d{v) is the degree of the vertex v. 
So in the sequential case, the time is We now bound this expression 

as follows. 

From statistics, 

n 

where a is the standard de\iation and X is the mean of the sample xi,a; 2 r • • of 
size n. If we let each Xj be the degree of vertex i of the graph then X = 

Thus, 




2 d(i’)- - 2d(u)A' + (A')2 
vev 


E <‘( 0 ? - 

v€V 


4m^ 

n 


Hence we can bound the sequential time by 

Zd(vf = na^ + ^ 

v€V ^ 

Observe that Xu€V^(t^)^ ^ where Xmax is the maximum 

degree of any vertex of G. 

The best algorithm Alon [3] takes time O(m^). The cost of our algorithm is 
0{na^ + Our algorithm has less cost if m < and < n°‘ Currently, 

a SB 2.4. Thus our algorithm is better if m < and cr^ < Moreover For random 
graphs, Bollabas [6] has showm that the \-ariance of the degree sequence equals the 
mean of the degree sequence. Thus, cr^ = ^ and hence for random graphs the cost of 
our algorithm would be O(^). But form the parallel algorithmic point of view our 
algorithm runs in constant time w’hereas a direct parallelization of the algorithm of 
Alon et al. [3] cannot run in time less than O(logn). This is because of the bottleneck 
of matrix multiplication [13]. Thus our algorithm is fully parallelizable even at some 
modest increase in cost in some extreme cases. The cost of our algorithm exceeds the 
cost of the algorithm of [3] only when m > with the value of a « 2.4. As will 
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Figure 7: Claw graph. The vertex v of degree 3 is called the central vertex. 

be shown in the next subsection, our constant time algorithm is useful in some cases 
and results in a constant time parallel algorithm for recognition of claw free graphs. 

.4 constant time parallel algorithm for this problem is described below. At every 
vertex we allocate d{v)‘^ processors. The computation at every vertex v is to check 
for all pairs of neighbours of u, say u, w, whether (u, w) € E. Thus we can detect 
the presence of a triangle in a graph in constant parallel time. The graph can be 
represented in the form of adjacency lists. The following theorem follows from the 
preceding discussion. 

Theorem 3.1 Given a graph G = {V,E), it can be checked in 0(1) parallel time 
whether G contains a triangle or not ® 

3.2.1 Recognition of Claw Free Graphs 

In this section we give a parallel algorithm for recognizing claw free graphs. The 
claw is the graph shown in Figure 7. The importance of the claw graph comes from 
matching properties, line graphs, comparability graphs etc. Any graph contaming 
claw as an induced subgraph cannot be transitively oriented. Thus claw' is a subgraph 
of incomparability graphs [24]. A graph is said to be claw free if it does not have an 
induced subgraph isomorphic to a claw (see Figure 7). Wo call the vertex of degree 
3 in a claw as the central vertex. The following theorem characterizes the claw free 

graphs [7, 37). 

Theorem 3.2 IfG is a claw free graph then every vertex can have degree atmost 2>/e. 
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Proof: Consider a vertex n. If G is daw free then G,v[„] is triangle free. Let 

p = 1/V[x]|. From Turdn s Theorem, [7], a graph with p vertices and without triangles 
can have utmost f edges. Hence there must be atleast lp{p - 1) - \p^ = Ip^ - Ip 
edges in Then G,v[b] must contain atleast ~ 2 P P — edges. This can 

be atmost the number of edges of G. Hence p < 2 s/m. I 

Thus the recognition algorithm proceeds as follow's. Compute the degree of e\'ery 
%'ertex v. If there are vertices of degree greater than 2y/e then the graph is not claw 
free. In this case the graph contains a claw with some vertex of degree 3 as the central 
vertex. Otherwise for every vertex consider the graph induced by complement of its 
neighborhood and check for the presence of a triangle in the resulting graph. If no 
such graph contains a triangle then the original graph is claw free. 

The time for constructing the complement of the neighborhood is constant. The 
cost is dominated by verification of triangle in the complement of neighborhood of 
every vertex v. Let C{m, n) be the cost of the entire algorithm. 

Then, 


C(m, n) 


= 

W6V 

< Y^d{v)^/m 

vicV 

< ms/m 


In the above equations the first inequality follow’s because ever}' vertex has atmost 
2 s/m neighbours and we are using the constant time triangle recognition algorithm 
of Section 3.2. The rest of the equalities follow easily. 

Thus the algorithm runs in O(logn) parallel time. The algorithm we presented 
in this section improves the sequential algorithm of [37]. The algorithm of Kick’s et 
al. [37] takes time O(m^) whereas our algorithm has only cost. In parallel 

setting our algorithm requires 0 (log n) time. We are able to eliminate the bottleneck 
of matrix multiplication in the case of parallel algorithms. The cost of the algorithm 
matches the cost of the sequential algorithm. 
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Figure 8: The graph /Cg. It is a incomparability graph on six vertices. 

3.3 Subgraphs on Six Vertices 

In this section we give recognition algorithms for two subgraphs on six vertices. This 
gives algorithms for recognizing graphs that are free of the forbidden subgraphs also 
which we are considering. The algorithms are based on matrix multiplication. 

3.3.1 Recognition of ICq 

The graph IC$ (See figure 8 is another forbidden subgraph of comparability graphs 
[24]. 

We give the following recognition algorithm for JCg. 

.\lgorithm Recognize-ZCeO 
begin 

Step 1: Let .4 be the adjacency matrix of the graph 
Step 2: Compute the square of the adjacency matrix into B 
Step 3: Compute T = B A A. 

comment From [29] T{u, v) = l implies there is a triangle through (u, v) 

Step 4: For each triangle in the graph do 

for each edge (u, v) of the triangle do 

compute the list L{u,v) = {w\u, v and w form a triangle in G} 

Step 5: From the lists computed above check for ICe at every triangle {uvw) by checking 
whether there is another triangle through (u, u), (u. w) and (u, w) 
end. {Algorithm} 

The time taken by the above algorithm is 0(n“ + where A is the maximum 
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dcgiee of G. Ihe term n® is for Step 1 and the term n^A is the time requirement of 
Step 4. In the case of parallel algorithm, the time required will be O(logn) and the 
cost is 0(n® + n^A). 

3.3.2 Recognition of Mq 

.\ k — tree is a graph that can be reduced to the k— complete subgraph Kk by a 
sequence of operations of “pruning a leaf’, that is, ^emo^•al of a degree k vertex 
with completely connected neighbours. A partial k - tree is any subgraph of a it - 
tree or, equivalently, a graph embeddable in & k — tree with the same vertex set 
[f)]. The motivation for studying partial k — trees comes ‘from many applications 
such as reliability of communication networks in the presence of constrained line 
and site failures, concurrent broadcasting in a common medium network and other 
applications. It is sho%\Ta that many problems that are iVP— Complete for general 
graphs have linear time algorithms for partial k — trees when k is fixed. 

A graph H is said to be a minor of graph G iff it can be obtained from G by a 
finite sequence of edge-extraction and edge-contraction operations. Given a graph G 
and an edge e of G, edge extraction results in the graph G — e with the same vertex set 
SIS G and the edge set E{G) — {e}. Edge Contraction results in the graph G/e with 
the same vertex set obtained from the vertex set of G by replacing the end points of 
e by a new “composite" vertex. This composite vertex inherits all the neighbours of 
the two replaced vertices without introducing a self-loop or multiple edges. 

It was conjectured by Wagner (see [33]) that for every infinite set of graphs, one of 
its members is a minor of the other. Thus, every class of graphs that is closed under 
minor taking has a complement with a finite set of forbidden minors. It was argued 
in [5] that the graph Me is a forbidden subgraph of 6 vertices for the class of partial 
3 trees [5]. The graph Me is shoum in Figure 9. We give a recognition algorithm for 
the graph Me- The algorithm is very similar to the algorithm for recognition of ICe- 

-Algorithm Recognize-A/eO 
begin 

Step 1: Let A be the adjacency matrix of the graph 

Step 2: Compute the square of the adjacency matrix into B 
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Figure 9: The graph Mq. This is a forbidden subgraph for partial-3 trees on six 
vertices. 

Step 3: Compute T = B A .4. 

comment From [29] T{u. v) = 1 implies there is a triangle through (u, u) 

Step 4; For each triangle in the graph do 

for each edge {u. v) of the triangle do 

compute the list L(u, v) = {wju, v and w form a triangle in G} 

Step 5: From the lists computed above check for Me at every triangle 
comment This is done as follows. At every triangle {uvw) check whether there are 
triangles from (u, y), (u, w) and (y, lo) other than this triangle. From all such triangles 
check again whether there is a triangle connecting the vertices a, b and c (See Figure 9) 
end. {Algorithm} 

The time taken will be O(logn) with cost 0(n“ + n^A). This can be proved as in 
subsection 3.3.1. The algorithms in this section are greatly simplified by knowing the 
presence of triangle through a given edge (u, v) of the graph G. 

3.4 Listing All Simplicial Vertices 

For many graph problems simplicial vertices can be safely removed, tackling the 
problem on the reduced graph. Thus it is desirable that we identify all the simplicial 
vertices efficiently. We formally define simplicial vertices below. 

Let N{v) be the set {icKu. ic) € E} and N[v] = N{v) U {u}. That is, N{v) is the 
neighbourhood of the vertex v and ^[u] is the closed neighbourhood of v. We denote 
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tho degrop of the vertex defined as |iV(u)l by d(v). 

Definition 3.1 A vertex v in a graph G is called a simplicial vertex if it’s neighbour- 
hood N(v) is complete. | 

Lemina 3-1 A vertex v is simplicial if and only if for all neighbours w of v, iV[i;] C 

iY[u;]. 


Proof; For the ‘if part let v be simplicial vertex and N[v] 2 N[w] for some 
w € V' and (y, w) € E. Then, it should be the case that there is a vertex u such that 
{v,u) € E and {n,w) ^ E. Thereby, by Definition 3.1, v is not a simplicial vertex 
resulting in a contradiction. 

For the ‘only if’ part, it can be observed that as for all neighbours w of v iV[u] C N[w] 
the neighbours of v must form a complete subgraph. 

Hence the proof. I 

To obtain a parallel algorithm for listing all simplicial vertices we proceed as 
follows. A sequential algorithm for this problem appears in [37]. Let A be the 
adjacency matrix of the given graph with diagonal entries set to 1. Find the square 
of the adjacency matrix, .4^. The following observation holds. 

Observation 3.1 


A%v] = |.Y[u] n iV[u]l 


I 


Also, from Lemma 3.1 for all simplicial vertices u of G we have 
Observation 3.2 

l.Y[u]niV[y]l = l.Y[u]l 

I 

From the Observations 3. 1.3.2 the algorithm is complete after the following step. 
If >i^(u, v) — A^(u, u) for all neighbours v of a vertex u then u is a simplicial \ertex. 
The work for this part is 0(m) at d{v) processors per each vertex. Thus the total 
cost id 0(n®). The time taken would be O(logn). The following theorem follows. 


28 



Theorem 3.3 I he set of all siniplicial vertices in a given graph can be obtained in 
f?{logn) parallel time using 0(n®) processors on the CREW model of computation. 

Tho above algorit!mi can be improved as follows. Let A be a parameter to be 
chosen later. Classify the vertices of the graph as low degree or high degree vertices 
depending on whether d{v) < A or d(v) > A. Let L be the set of low degree vertices 
and H be the set of high degree vertices. The following observation holds. 

Observation 3.3 If a vertex v of high degree is simplicial then all its neighbours are 
of high degree. 

Proof: If on the contrary, v is a high degree simplicial vertex and has a low degree 
neighbour w, then by Lemma 3.1 \N[w] > liV[u]l = d{v) + 1 > A + 1. Thus w would 
also be classified as a high degree vertex. I 

Thus we obtain the given below. 

.•Algorithm List-Simplicial-Vertices{) 
begin 

Step 1; Classify vertices into low degree and high degree vertices; 

Step 2: Search all simplicial vertices of low degree by explicit checking; 

Step 3; In the graph Gh mark all vertices that are incident to a low degree neighbour; 
Step 4: Search al simplicial vertices of high degree using matrix multiplication; 
end. {.Algorithm} 

The above algorithm can be implemented in parallel O(logn) time. The cost is given 
by 0(mA + (f )“) as Step 2 takes 0{Z,^Ld{v?) < 2Am and there are atmost^ 
vertices of H and matrix multiplication costs 0((x)“)- of A = 

the cost of the entire algorithm is 

.■Alternatively, a constant time parallel algorithm for this problem can be given 
along the lines of Section 3.2. The idea is to assign to each vertex 0(d(u)2) processors 
which check the adjacency of each pair of neighbours of that vertex. Thus the number 
of processors used would be d{vf which can be bounded as in Section 3.2. 
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Chapter 4 

On the Parallel Complexity of 
Maximal Matching 


Presently, optimal parallel algorithms for computing a maximal matching in a general 
graph as well as bipartite graph take same time. In the sequential case for the un- 
weighted maximum matching problem the best known algorithms for general graphs 
and bipartite graphs take the same time 0{m\/n). But the algorithm for bipartite 
graphs is based on network flow techniques and is much simpler than the algorithm 
for general graphs which is based on a theor>' of alternating paths and blossoms. 

So, there is evidence to conjecture the existence of a simpler and/or faster optimal 
parallel algorithm for maximal matching in bipartite graphs exists. We show that such 
an algorithm results in a improvement in the case of general graphs also. « 

4.1 Introduction 

A matching in a graph C? is a subset of edges M such that no two of them share a 
common end point. A matching .1/ is maximal if .1/ is not a proper subset of another 
matching. In this Chapter we relate the complexity of computing a maximal matching 
in a graph to the complexity of computing a maximal matching in a bipartite graph. 

The parallel computation of maximal matching is a well studied problem. Israeli 
and Shiloach [28] gave an randomized O(logn) time algorithm and a deterministic 
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OClog'"* n) tune algorithm with 0{m+n) processors. Han [26] showed that the problem 
ha.s an r)(loR= n) time algorithm with 0{(m+n)/ log°-^ n) processors. This algorithm 
i.s subt>ptimal. The cost of the algorithm is 0((m + n)log^n). Han [26] also gives an 
*^0 thue r)(A/(ii)) processor algorithm where M{ti) is the processors required 
to multiply two n x n matrices. Currently M{n) « Tt^-\ Kelsen [35] gave a O(log^n) 
time optimal algorithm for general graphs and 0{log^n) time optimal algorithm for 
bipartite graphs. This was later improved by Han [25] to O(log^ n) optimal algorithm 
for general graphs, 

4.2 Reduction 

We give a decomposition scheme for decomposition of a given graph G = (V, E) into 
a set of edge disjoint bipartite graphs. The result is a generalization of the result of 
Kelsen [35]. 

Lemnoa 4.1 Any graph G = (V, E) can be decomposed in to 0(logC'„(n,m)) edge dis- 
joint bipartite graphs if the graph G is validly vertex coloured using C'v(n,m) colours. 

Proof: Let a vertex colouring algorithm colour the vertices of the graph us- 

ing C - v{n, m) colours. Each colour can be interpreted as a bit string of length 
0(logC„(n, m)). We refer to the colour of vertex v as Cv{v). 5u(u) may refer to the 
bit string corresponding to Cv{v). We now partition the edge set such that the edge 
e = (u, v) is kept in the subgraph i = min bit of 5„(u) 0 Sv{v) = 1}. With this 
we can obtain a 0(logC„(n, m)) bipartite decomposition of G. I 

Such a decomposition as a bipartite decomposition of G and the number of sub- 
graphs obtained is called the size of the decomposition [35]. Our reduction is according 
to the following algorithm. 

Step 1: Number the vertices of G from 0 to n — 1. 

Step 2: Partition the edges such that e = (u,u) is kept in the graph Gi iff i = 
mm{j\jth bit of U0U = 1 }. 

comment: this is to obtain the bipartite decomposition of size O(logn) 

Step 3: Run prefix sums on each Gi to count the size of each Gi. 
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Figure 10: Merge does not give a maximal matching 


Step 4. Use optimal algorithm for bipartite graphs to find a maximal matching in 
each Gi in parallel. 

Step 5. Obtain a set of matching edges by colouring the graph using O(logn) colours 
with all edges of maximal matching from subgraph i coloured with i. 

Step 6. Merge the maximal matchings of the subgraphs pairwise 

Kelsen [35] has shown that Steps 1-3 can be done in 0{logn) optimal time. The 
merging process of Step 6 is described in detail below. Let Gi and Gj be any two 
subgraphs and -1/i and Mj be the maximal matchings of these subgraphs. In AliUMj, 
each vertex has atmost 2 edges incident at it and one is coloured red and the other 
blue. Since the graph is of bounded degree, the computation of maximal matching is 
optimal and within O(logn) time. The graph can be decomposed into list graphs in 
constant time. Now list ranking can be used to 2-color the edges of the graph. Let the 
colours used be C\ and co. Take edges of colour Ci and obtain a maximal matching. 
The time is 0{logn) optimal. 

But the above algorithm does not gaurentee that the set of edges so obtained is 
a maximal matching of the original graph. This is so because during Step 6, some 
vertices which were matched in the subgraphs earlier may now become free. There can 
be some free neighbours for these vertices and the matching may fail to be maximal. 
(See Figure 10.) Thus we need to enlarge the matching we have obtained so far to 
get a maximal matching. For this we proceed as follows. Let the set of edges in 
the matching obtained be M. Remove all edges which share an end point with some 
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Figure 11; The bad case. The vertices a,b,c will not be covered by the matching 

edge in M and remove all isolated vertices. Let F be the set of vertices that are 
free originally, that is before merging. Let S be the set of vertices that became free 
due to merging. Consider the graph = S, E') where E' consists of the edges 
of the original graph between the vertices of F and S. This graph is bipartite by 
construction with F and S being the bipartition. Run bipartite maximal matching 
algorithm on this graph. Update M as M U M' where M' is the maximal matching 
returned by the bipartite maximal matching algorithm for the graph G'. 

Step 7. Obtain a bipartite graph with the bipartition (F, S) and use bipartite 
maximal matching algorithm to obtain a maximal matching 

But even this step does not ensure that the maximal matching is computed. The 
matching may fail to cover vertices of S that do not have any free neighbours. (See 
Figure 11.) 

As the number of such vertices can be as high as the number of the even paths, 
the number of vertices in the auxiliary graph can be about a constant fraction of 
the number of vertices of G. The number of edges also reduce by a constant fac- 
tor. So we use the above scheme for 2 log log n iterations. As problem size reduces 
to for some constant c, we use the nonoptimal algorithm. The time 

required will be reduces to log" n) = 0(Tn -!- n) as the 

cost of the nonoptimal algorithm is 0((m -H n) log^ n). 

Step 8. Use the nonoptimal algorithm to compute the maximal matching in the re- 
maining graph 


We now analyze the time for the entire schema. The computation of M takes 
time tt(n, m)+ logn. The computation of maximal matching in G' takes again time 
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t6(n, m). This is repeated for 21oglogn iterations. We need another tso{n,m)tmie 
to obtain the maximal matching of the original graph. Thus the total time is 
0{tb{n,m)lo^\ogn + tso{n,m)). Hence the following theorem follows. 

Theorem 4.1 If tb(n,m)is the time taken for computing a maximal matching in a 
bipartite graph and there is an algorithm A to compute a maximal matching in a 
general graph which is polylog arithmic away from optimality then there is an optimal 
algorithm that runs in time 0{tb{n,m)\og\ogn + tso{n,m)). I 

4.3 Final Remarks 

From Theorem 4.1, we have obtained an equation relating the parallel complexity of 
maximal matching in a bipartite graph and a general graph. Currently, tb{n,m)= 
O(log^n) [35] and tsoin,m)— O(log^’^n) [26] and is nonoptimal. Also, Han [25] gave 
an optimal O(log^n) algorithm. This, we believe is rare as the time complexity of 
bipartite maximal matching is same as that of general graph. So any improvement 
in the complexity of the bipartite maximal matching problem can be made to reflect 
in improvement in the time complexity of the general graph. 

.A.lso, we can tighten Theorem 2 of Kelsen [35] as follows. The following theorem 
is from Kelsen [35]. 

Theorem 4.2 ([35, Theorem 2])J/ there is a parallel algorithm that computes a 
maximal matching in bipartite graphs in time t(n) = fl(logn), then there is a parallel 
algorithm that computes a maximal matching in a general graph in time 0{t{n) logn). 
If the first algorithm is optimal so is the second. 

Using Lemma 4.1, we can tighten Theorem 4.2 and replace the term 0(t(n)logn) 
by 0{t{n) loglC(u)l) where |C(t/')| is the number of colours that the vertex colouring 
algorithm uses. The only requirement we insist on the vertex colouring algorithm is 
that it should be optimal. This removes our dependence on the nonoptimal algorithm 
for maximal matching from Theorem 4.1. 
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Chapter 5 

Parallel Edge Colouring of Graphs 


Linial [40] has given a nonconstructive distributed algorithm to find an 5A^logn 
colouring in one round and had left the problem of constructing explicit decomposition 
open. This has been solved in this chapter by gi%‘ing an A-logn edge colouring in 
constant time. Further, if we are provided with a vertex colouring using k colours we 
show how to reduce the number of colours to O(A-logfc). This was also stated in 
Linial [40] but the proofs are nonconstructive. 


5.1 Introduction 

Graph colouring, vertex colouring as well as edge colouring, is amongst the most 
extensively studied problem in graph theory because of its applications in schedul- 
ing, optimi 2 ation and other related problems in operations research. An example 
of application of edge colouring is the file transfer problem in computer networks. 
This problem was introduced by Coffman et al. [11]. In this problem, every node 
in the network has a communication port and there are several files to be transfered 
in between the pairs of nodes. The problem is to schedule the file transfers so as to 
minimi2e the total time taken for ail the transfers. 

In the edge colouring problem we seek to assign colours to edges of the graph such 
that no two edges sharing a common end point get the same colour. The minimum 
number of colours for which such a colouring exists is called the edge chromatic 
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Figure 12: A 4-edge colouring of graph G. The numbers at each edge represent the 
colour assigned to that edge. 

number of the graph and is denoted by ^ (G). An example of edge colouring is shown 
in Figure 12 using 4 colours. Clearly, x^(G) > A where A is the maximum degree 
of any vertex of G. Vizing [31, 19, 42] and Gupta [42] showed that x' < A -h 1. This 
is popularly known as Vizing’s theorem. Thus any graph can be edge coloured using 
either A or A + 1 colours. Graphs with y(G) = A are called as Class 1 graphs and 
graphs with ^(G) = A + 1 are called Class 2 graphs [19]. The problem of deciding 
whether a graph belongs to Class 1 or Class 2 is called the classification problem of 
graphs [19]. This problem is shown to be NP-Complete even when restricted to cubic 
graphs (A = 3) by Holyer [27]. Thus edge colouring of graphs with optimal number 
of colours is computationally difficult. 

Approaches like restricting to a particular type of graphs to get optimal colouring 
or to settle for a near optimal edge colouring are followed and have yielded good 
algorithms on the sequential model of computation. On the parallel model of com- 
putation, namely PRAM(described briefly later in Section 5.2), XC algorithms for 
edge colouring planar graphs and bipartite graphs exist [10, 9, 38]. But A -r 1 edge 
colouring arbitrary graphs is not known to be in NC. Existing algorithms [34, 39] 
have time requirements in 0{^^polylog{n)) for A: > 0 and hence belong to XC only 
if the maximum degree of the graph is of the order of (logn)* for some constant k. 
Otherwise they take time proportional to 0((A)*). Purer and Ragha\’achari [23] gave 
an A^ approximate edge colouring algorithm for arbitrary graphs in 0(log* n) time. 

In this chapter we give an A^ log n edge colouring algorithm. We also show that we 
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can reduce the number of colours to if we are provided with a valid vertex colouring. 

5.2 Definitions 

-A.n edge colouring of a graph is a mapping from C : E N. An edge colouring is 
said to be valid iff no two edges in the graph incident at a same vertex get the same 
colour. A colouring that uses k — ICj colours and is valid is said to be a A:— colouring 
of the given graph. The minimum number of colours required to validly edge colour 
the given graph is called the edge-chromatic number or the chromatic index of the 
graph and is denoted by x^- A. colouring is said to be minimal if it uses the least 
possible number of colours over all valid colourings. Any sdge colouring of G is 
called an optimal edge colouring. For sake of convenience, we represent colours by 
integers. 

5.3 Edge Colouring Graphs in 0(1) Time 

In this section, we consider the edge colouring of general graphs. As was mentioned 
in Section 5.1, the problem of A + 1 edge colouring general graphs is not known to be 
in NC. Many graph problems that did not have efficient parallel algorithms are solved 
efficiently when restricted to bipartite graphs [35, 25]. We employ this approach to 
reduce the edge colouring problem of arbitrary graphs to edge colouring bipartite 
graphs at the cost of extra colours. 

We make use of the generalized bipartite decomposition of a given graph into edge 
bipartite graphs introduced in Chapter 4. We recall it in the following Lemma. 

Lemma 5.1 Any graph G = (V' E) can be decomposed in to 0(logC„(n, m)) edge dis- 
joint bipartite graphs if the graph G is validly vertex coloured using C„(n, m) colours. 

Proof: Proof is given in Chapter 4. I 

We now state a result regarding edge colouring bipartite graphs. The following 
Theorem is from Purer and Raghavachari [23]. 
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Theorem 5.1 A bipartite graph G can he validly edge coloured using O(A^) colours 
in constant time using 0{n + m) processors. 

Proof: The proof is based on the fact that if we have a bipartition of the graph 

G, then each edge can be treated as a half edge and every vertex colours the half 
edges incident at it using a palatte of size A„. Combining the colours of each half 
edge gives a valid O(A^) edge colouring of G. 1 

We now describe our algorithm for obtaining a constant time algorithm for edge 
colouring of graphs. 

.Algorithm Edge-Colour-Graph(G) 

Input: A graph G = (V, E) 

Output: A G(A^ logn) valid edge colouring of G 
begin 

Step 1. Obtain a 0{logn) decomposition of G as given below 
Step 1.1. number the vertices from 0 to n — 1 
Step 1.2. place edge e = (u, u) in subgraph i if i = min{j\ bit j of 
u©u = 1} 

comment: Each Gu i = 0, 1, • • • ,logn is bipartite from the construction . 

Step 2. Colour each subgraph in constant time using Theorem 5.1 
comment: Each subgraph can be coloured using A^ colours where A 
is the maximum degree of G. 
end. {Algorithm} 


The constant time follows because we do not run prefix sums after the decom- 
position. Instead, we assign a processor to each edge and to every vertex. The 
computation at each edge is done with the processor assigned to it. As the maximum 
degree in each subgraph is atmost A, the number of colours used would be, 

logn 

iV(n, m) < £ A^ 

1=0 

< A^logn 
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Thus we give an explicit algorithm to edge colour a graph in constant time using 
atmost A^logn colours. 

In the following w'e assume that we are pro\’ided with k valid vertex colouring 
algorithm and reduce the number of colours required by the edge colouring algorithm. 
.A.fter having a k vertex colouring, we can get a O(logfc) bipartite decomposition of 
G (From Theorem 4.1.) Thus the number of colours used would be 

logfc 

N{n,m) < ^ 

1=0 

< A^ log k 


From the above equation, edge colouring of graphs is equh'alent to vertex colouring 
at the cost of extra colours. 

5.4 Edge Colouring of General Graphs 

In this section we give an algorithm for edge colouring general graphs. The algorithm 
us(?s edge colouring algorithms for bipartite graphs. We assume that there is an algo- 
rithm that can colour bipartite graphs in 0{tb{n,m)) time using C6(n,m, A) colours. 
The algorithm is assumed to employ a linear number of processors. The algorithm 
for general graphs follows. 

.Algorithm Edge-Colour-General-Graph(G) 

Input: A graph G = (1' E) in adjacency list representation 

Output: .\ valid edge colouring of G. 

begin 

Step 1. Obtain a O(logn) bipartite decomposition of G 
comment: This takes O(logn) time and 0{m + n) work [35]. 

Step 2. Colour each bipartite graph independently in parallel using Cb{n,m) colours. 
Step 3. Combine the colours to produce a valid colouring of G. 
end. { Algorithm } 


39 



The number of colours used and the time requirements me given in the folloTsing 
Lemmas. 

Lemma 5.2 In the O(logTi) decomposition of G, the degree of subgraph i is utmost 
A} where A is the maximum degree ofG. 

Proof: The proof follows from the observation that there can be atmost 2* vertices 

that differ in LSB position i. I 

Lemma 5.3 The time taken by the above algorithm is 0(logn + t6(n, m)). 

Proof: The time for obtaining O(logTi) bipartite decomposition is O(logn) using 

0(m4-n) work [35]. Step 2 takes 0{tb{n,m)) time. Hence the total time is 0(logn+ 
4(n,m)). I 


Lemma 5.4 The algorithm uses Cb{n, m, 2*) log A + Cb{n, m, A) log(^) colours. 

Proof: By Lemma 5.2 the degree of each subgraph is atmost min{2*, A}. Hence if 

we assume that the bipartite graph edge colouring algorithm uses cij{n, m, A) colours, 
t he number of colours used for the general graphs would be 

logn 

.V(n, m) = C6(n,m,min{2*, A}) 

jsO 

log ^ log n 

= C6(n,m,20+ ^ Cb{n. m, A) 

i=log A 

71 

- C6(n, m, 2*) log A + C6(n,m, A) log(—) 



5.5 Conclusions 

In this Chapter, we have given constructive proofs for twm results stated by Lmial 
[40]. Further, the constructions and algorithms are very' simple. 
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Chapter 6 


Minima in Sublinear Time 


6,1 Introduction 

The problems involving integers drawn from a restricted universe have received con- 
siderable attention. Integer sorting [2, 4, 36], for example, is very well studied and 
better algorithms in sequential as well as parallel settings are still sought. Data struc- 
tures that support priority queue operations in less than O(logn) time per operation 
are also devised [15], [16]. In parallel setting for related problems like integer prefix 
(computation good lower bounds are also proved by Chaudari and Radhakrishnan [8]. 
The bound proved is 0(a(n)) using a linear number of processors. 

In this Chapter we consider the problem of finding the minima of n integers 
x'l . • • • , Xn from the universe C/ = [0 • • • m - 1]. Although the information theoretic 

bound of 0{n) is established the universality of the bound differs greatly. By enhanc- 
ing the machine instruction set to include bit-wise operations like shifting and logical 
operations like AND, OR etc. it is shown that for problems like integer sorting it is 
possible to beat the information theoretic lower bound [22]. 

In this Chapter we show that using a machine model that provides support for 
bit-wise operations and logical operations, we can cross the lower bound of 0(n) 
for the problem of finding the minimum of n integers. The bound established is 
0 (ti/ logn) in the sequential case. The main idea we use namely packing of integers 
in a single machine 'word is excessively used. In integer sorting of [2] a variant of 
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bitonic sorting is used and the ability to pack integers results in a better time than 
0(rz log n). Similar techniques -u-ere also used in [4, 22]. 

Kirkpatrick [36] introduces the notion of conservative and nonconservative nature 
of the algorithms. An algorithm is said to be conservative if it uses a word size of 
0(logm + logn) for n integers taken from the universe U = [0 • • • m - 1]. Otherwise 
the algorithm is nonconservative. Our algorithm is nonconseiv^ative. Although integer 
sorting problem is well studied on the conservative as well as the nonconservative 
models of RAhl’s and PRAM’s not much is known about the bounds of other problems 
on the nonconservative PRAM wherein we can exploit the word parallelism to obtain 
better bounds [14]. 

The rest of the Chapter is organized as follows. In section 6.2 we present a 
sequential algorithm to compute the minima of n integers in 0{n] logn) time thereby 
proving our main result. In section 6.3 we give a parallel version of our algorithm 
that runs in O(logn) time and having a cost of 0{n/k) with word size of 0{k*logm). 
In Section 6.4 discusses some issues relevant to the performance of the algorithm. 

6.2 Sequential Algorithm 

We first give the format of the machine and the machine word that we make assume. 
The machine supports the operations LSHIFT and RSHIFT defined below. LSHIFT, 

X = y « z means that the value obtained by left-shifting y by z times is assigned 
to X. Observe that the value z is available as a pre-compuied constant or program 
generated value [22]. It is equivalent to multiplying y by 2^. The operation RSHIFT, 
X = y » z amounts to dividing y by 2^. Thus our machine instruction set consists 
of { +, LSHIFT, RSHIFT, .\ND, NOT } . Although this set appears to be less 
powerful than the conventional instruction set of { +, x, [/J }, it is indeed the 
case that the former introduces the power of exponentiation, if shifts are used in a 
completely unrestricted fashion, as observed by Kirkpatrick [36]. In our machine a 
word consists of k integers, for a parameter k, and is of the form shown in figure 13. 
The word has fields numbered 1 to k from right to left. The extra bits serving as 
delimiters, which we call as indicator bit can be suitably set or reset as per requirement 
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field k 

field k-1 



field 2 

field 1 

0:3^ 


m 

• # 

0:X2 

0:x 

1 


Figure 13: Format of a machine word 


and can be done by appropriate AND-ing or OR-ing. Such a word is written as [0:xfc, 

0:xfc_i, • - •, 0:xi]. 

We now present a simple routine to find the minimum of the integers container 
in two machine w'ords in O(log&) time. 

6-2.1 Minima of two words 

Our algorithm has a preprocessing phase which is described below. We need th 
constant word A'l = (1 : 0, 1 : 0, • • • , 1 : 0]. The word Kl can be generated using th 
following code [2j. Let I = flogm]. 

A'l = 1<< J ; 
for i = 1 to log/c do 

Kl = (A'l « 2‘ * 1) .A.ND A'l; 
onclfor; 

Clearly, the above routine takes 0{\ogk) time. 

We also need some masks during the running of the algorithm. Because of tl 
symmetric nature of the masks it is possible to generate them in constant time on 
one of them is generated. The masks required are of the form (1)'‘(0)'' for valu 
of r namely r = k/2. k/A, - ■ when the indicator bits are \iewed as a string of ! 
and O's. The field is to contain 0 in all the positions. We generate the mask i 
r = k/2 explicitly. Using that we show how to generate successive masks. The co 
to generate the mask for r = k/2. we call it A/1, follow's. 

/* A/1 = (l)''/2(0)*/2 */ 

/* Use A1 generated earlier */ 

.1/1 = A1 » 
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Ml = ifl « 2*/2 * 

As can be seen, this routine takes constant time. Moreover, it can be easily ob- 
served that the generation of other masks is trivial. This completes the preprocessing 
step and we now give the algorithm to find the minimum of the integers in two 
machine words. 

procedure minimum..2(A,B) 

I* A and B are the two words each containing k integers showm in the format of figure 13. 
Word A has indicator bits set to 1. 
for t = 1 to log k do 

/* subtract the words A and B */ 

K2 = A- B ; 

J* Need only indicator bits */ 

K2 = K2 AND /Cl; 

/* extract the local minima */ 

Cl = A - (/C2 AND A); 

C2 = B AND K2] 

C = Cl OR C2; 

/* .split the word C */ 

Cl = C AND Me; 

C2 = C - Cl; 

A = Cl » 2* 

B = C2; 
endfor; 
end; 

Clearly, each iteration of the above algorithm takes constant time in our machine 
model and hence the entire algorithm takes O(logfe) time. 
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6.2.2 Minima of n integers 

We now use the above algorithm to compute the minimum of n integers Xi, X 2 , • • - , 
drawn from the universe i7 = [0 - • • m - 1]. The algorithm follows. 

procedure minima_a() 
for i = 1 to n/2 * A: do 

find the minima of n/2 * fc pairs of words 
endfor; 

find the minima of the remaining n/2 * k integers by 
recurring over problem of size 0{nfk). 
end; 

The recurrence relation guiding the algorithm is T(n) < nfk*T{k) +T{n/k); with 
T(n) being the time taken to solve the problem of size 0(n) and T{k) = 0{\ogk). 
The solution for this recurrence can be seen to satisfy T{n) < 0{n/k * logk). 

Choice of k 

Different values of k lead to different algorithms. In particular if fc = O(logn) the 
time would be 0(n log log n/ log n) with a word size of O(lognlogm). But if A: = 
0{logiogn), the time would reduce to 0(n/ log n) but the word size requirement 
grows to 0(lognloglogn * logm). 

Theorem 6.1 The minima of n integers xi,X 2 ,---,Xn drawn from the range U = 
[0 • • - m] con be computed in time 0{n*\ogk/k) on a sequential machine using a word 
size of 0{k * logm), for k > 1. 


6.3 Parallel Algorithm 

In this section we give a parallel algorithm for the algorithm presented in section 6.2. 
The model of parallel computation we use is the EREW-PRAM wherein only one 
processor is allowed to either read or v,Tite to the same memory location. .A. parallel 
algorithm is judged by two parameters namely the time taken by the algorithm and 
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the amount of work it does. The amount of work is measured as the product of 
the time taken and number of processors employed. The information theoretic lower 
bound established for this problem is O(iogra) time and 0(n) work [30]. Our algorithm 
runs in time O(logn) time but does only 0(n log n/fc) work. We use the algorithm 
to find the minima of the integers contained in two words given in section 6.2.1. The 
algorithm follows. 

procedure minimaja(n) 

for n/2 * k pairs of words pardo 

find the minima of the words in 0{\ogk) time 
endfor 

recur over problem size 0{n/k) 

end 


The recurrence relation guiding the time taken by the algorithm is T(n) = 0(log fc)+ 
T{n/k) whose solution is T(n) = O(logn). The number of processors used is 0{n/k) 
and hence the work done is clearly 0{n\ognfk). ■ 

Theorem 6.2 The minima of n integers drawn from the range U = — 1] 

mn be found in time O(logn) with 0{n/k) processors on the EREW-PRAM on a 
non-conservative PRAM. 


6.4 Practical Issues 

Our main result that there is a sequential algorithm to compute the minima of n 
integers drawn from the range U = [0 ■ • • m — 1] ignores the fact that under any 
model of computation, it requires f2(n) time to read the input of n integers. But we 
can circumvent this problem if we assume that the input is instead 0{nfk) words 
each containing fc integers. This case is not too abnormal as minimum finding is a 
problem that appears as a subprobiem in many other problems and thus we need not 
charge the time of n(n) for inputting on our algorithm. Rather we can assume that 
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the superproblem indeed reads the integers one by one taking 0(n) time units but as 
part of computation needs our minimum finding algorithm as a subtask. 

In effect, we assume that the input to our algorithm consists of n/k words each 
containing k integers. Although this may appear out of context and exotic, as Al- 
bers and Hagerup [2] note “algorithms using a nonstandard word length should not 
hastily be discarded as infeasible and beyond practical rele\’ance” . We seek to find ap- 
plications of our algorithm or obtaining better algorithms for related problems which 
subsequently result in better algorithms for what we call superproblems like integer 
sorting and integer priority’ queue. 
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Chapter 7 


Conclusions 


7.1 Conclusions 

In this thesis, we proposed improved algorithms in the following cases. For the prob- 
lem of minimum spanning tree considered in chapter 2, we have improved the existing 
result of Pawagi and Ramakrishanan for the edge insertion problem from O(log^n) 
time O(n^) work to O(logn) time and 0{n) work. We have also applied the bounded 
incremental analysis of Ramalingam [44] and obtained an 0((5) time sequential algo- 
rithm for the same problem. 

As to our knowledge, we know no existing parallel algorithms for the small sub- 
graph recognition problems. There are some sequential algorithms for this problem 
[37]. For the claw free graph recognition problem, a sequential simulation of our 
parallel algorithm improves the result of Kloks et a/. [37]. Moreover, we are able to 
obtain constant time for some of the problems thereby getting algorithms that are 
fully parallelizable. 

For the maximal matching problem, Theorem 4.1 improves the result of Kelsen 
[35] to a factor of log log n. (Theorem 2 of Kelsen [35] proves that the complexity of 
maximal matching in general graphs is O(logn) times the complexity of the problem 
on bipartite graphs.) Also, our schema is optimal as that of Kelsen [35] and Han [25]. 
Any improvement in the existing algorithms for bipartite graphs will now result in a 
improvement for the general graphs as well. 
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For the edge colouring problem we gave a very fast algorithm to produce a \-alid 
log n) colouring. We also showed how to reduce the number of colours by making 
use of a generalized decomposition of graphs. 

For problems invohdng integers, integer sorting and priority- queues are well stud- 
ied. We considered the problem of minima of integers and showed that minima can 
be obtained in less than linear time in certain circumstances. 

7.2 Further Work 

In this thesis, we have proposed an bounded incremental algorithm for updating the 
minimum spanning tree under edge insertions. A related problem is to update the 
minimum spanning tree under multiple edge insertions [17]. That is, to the existing 
graph, a set of new edges are being added to the current graph. 

Obtaining better bounds on the variance of the degree sequence of a graph is 
also interesting. This would immediately reflect in improvements of the algorithms 
of chapter 3 both in the sequential case and also the parallel case. Most notably, 
improvement would be for the case of presence of triangle, recognition of claw free 
graphs and listing simplicial vertices. 

Better expression to relate the time complexities of maximal matching in bipartite 
graph and general graph are also of interest. Also the applications of bipartite decom- 
position can be generalized to hypergraphs as observed by Kelsen [35]. So it is a open 
question to obtain algorithms for computing a maximal matching in hypergraphs , 
either general or of constant dimension. A Lemma that can help in such problems is 
sated below. The Lemma is taken from Lovasz [41]. 

Lemma 7.1 At Tegular hypergraph can be split into two hypergraphs such that atleast 
one of them contains utmost ^=1 edges. 
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